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ABSTRACT 

Wc present the largest, most homogeneous catalogue of merging galaxies in the nearby 
universe obtained through the Galaxy Zoo project - an interface on the world-wide web 
enabling large-scale morphological classification of galaxies through visual inspection 
of images from the Sloan Digital Sky Survey (SDSS). The method converts a set of 
visually-inspected classifications for each galaxy into a single parameter (the 'weightcd- 
mcrger-vote fraction,' f m ) which describes our confidence that the system is part of 
an ongoing merger. We describe how f m is used to create a catalogue of 3003 visually- 
selected pairs of merging galaxies from the SDSS in the redshift range 0.005 < z < 0.1. 
We use our merger sample and values of f m applied to the SDSS Main Galaxy Spectral 
sample (MGS) to estimate that the fraction of volume-limited [M r < —20.55) major 
mergers (1/3 < M^/Mj < 3) in the nearby universe is 1 — 3 x C% where C ~ 1.5 
is a correction factor for spectroscopic incompleteness. Having visually classified the 
morphologies of the constituent galaxies in our mergers, we find that the spiral-to- 
elliptical ratio of galaxies in mergers is higher by a factor ~ 2 relative to the global 
population. In a companion paper, wc examine the internal properties of these merging 
galaxies and conclude that this high spiral-to-elliptical ratio in mergers is due to a 
longer time-scale over which mergers with spirals are detectable compared to mergers 
with ellipticals. 
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1 INTRODUCTION 



Galaxy mergers and interactions are connected to many 
pressing questions concerning the origin, evolution and 
properties of cosmic s tructures. These issues include the 
form a tion of galaxies (I White fc Reed 1 19781; lLacev fc Cold 
1 19931 : IConselicd l2006al : be Lucia et al l 1200611. cnvironmen- 



tal e f fects on morpho l ogy dCapak et al.1 |2007l ; iPark et al.l 
120071 : iBall et al.1 |200St IVan der Well 120081 ) and the distri- 
bution of the dark matter that dr i ves the merger proces s 
(|Bond et al.lll99ll : ICole et al.l I2OO0I : iFakhouri fe Mai |200S| ). 

At the sub-galactic scale mergers have been invoked to ex- 
plain a variety of observations, notably localised bursts of 

star formation rtKennicutt et al.lll98 7: Schw eizer et al. 2005; 
„ , . . : ,11 1 l„, , „ „ ,, II 1 l„ ,1 



Pi Matteo et al.ll2005l; IWoods fe Gellerl 120071 ; iBarton et all 



20071 : ILi et al.l 120081; Cox et al.l 120081) and induced n uclear 
activ ity (|Keel et al.l Il985l : ISchawinski etahl l2007al : IJoged 
120081 ). The far-reaching effects that mergers are thought to 
produce makes their empirical examination an important 
task. 

To date tho ugh, such studies hav e concentrated mostly 
on merger rates (I Car lb erg et al. 2000; Le Fevre et al. 200C ; 



Patton et al. ll2002i;IConselice et al. II2OO3I; iBundv et aljbood: 



Bell et al.ll2009; IConselice, Rajgor fc Myerdl2008l; iLin et al 
20081 ; lLotz et al.l l2008al ; Iflsieh. et al.l |2008| . IPatton et a! 



20081 ). with comparatively little carried out to exam ine their 



morphologies and intern al properties (though see ILi et al.l 
|200S| ; lEllison et al.ll200sl ) for reasons to be discussed (JEHJ. 
This is unfortunate - to understand the exact role played by 
mergers in galaxy evolution, it is inadequate to measure their 
rates alone since the processes that determine the morpho- 
logical outcomes of mergers are still not fully understood. 

It has been w i dely believed since the simulations of 
iToomre fc Toomrel (| 19721 ) that two spirals can merge to 
form an elliptical but this does not mean that they must. 
On the contrary, recent studies have argued convincingly 
that, in some cases at least, disc galaxies are able to sur- 
vive (multiple) major mergers (|Hopkins et al.|[2009h . Where 
this does happen, the probability of disc survival must be as- 
sumed a priori to depend on the properties of its progenitors 
such as environment, gas content and feedback mechanisms. 
These in turn correlate with galactic morphology. There is 
also good reason to think that the time-scales over which 
mergers are detect able depends on the internal properties 
of the progenitors (|Lotz et al.ll2008bT ). Ideally then, calcula- 
tions that convert merger fractions into merger rates as part 
of some hierarchical-struct ure scheme (such as modeled by 
iKhochfar fc BurkertJ I2003T ) should take the properties and 
morphologies of their sample into account. Only then can 
models of hierarchical galaxy formation be fully tested. 

The Galaxy Zoo project contributes to this important 
task by providing a snapshot of mergers as they appear 
in the local universe (in this study we focus on the range 
0.005 < 2 < 0.1). Plans are also underway to apply the 
same merger-locating system to higher redshift surveys. In 
this paper we present a simple but powerful method for 
identifying mergers using the world-wide web and a robust 
sample of 3003 merging systems obtained by it f t)2.H) Fl We 
then discuss three important ratios revealed by this study: 
the merger fraction of the local universe ( >i3.1l and 13. 3p . the 

1 We have labeled this catalogue GZM1 (Galaxy-Zoo Mergers 1). 



Figure 1. Example images of 'strongly-perturbed' systems which 
are not deblended into two photometric objects that plausibly 
represent the photometry of the progenitors. These are usually 
late-stage mergers and cannot be detected by the close-pairs tech- 
nique. Blue circles mark the centre of SDSS photometric objects. 
Red squares mark the centre of SDSS spectral objects. The scale 
bar is 5" for each example image. 



fraction of spectral pairs in merging systems in SDSS (rel- 
evant to discussions for the close-pairs technique; i|3.4[) and 
the spiral-to-elliptical ratio of galaxies th at are in merging 
systems (§33]). In the companion paper, iDarg et"al ] 120091 
(D09b from here on) , we present the environment and inter- 
nal properties of these merging galaxies. 

Galaxy Zoo is a new player in the game of merger- 
location techniques. Before describing our results, we begin 
with a description of current methods used to find mergers 
so that the role and value of Galaxy Zoo can be understood 
in the context of contemporary research. 



1.1 Locating Mergers: Past Methods 

Visual inspection remains the most trustworthy way of de- 
termining whether or not a galaxy is merging. H owever, the 
adven t of large galaxy surveys such as SDSS (|York et al.l 
l2000| j involving ~ 10 6 objects has rendered classification 
by individual researchers impractical. Automated methods 
have therefore been developed to approximate the human 
decision making process but, being only approximations, en- 
counter certain difficulties that Galaxy Zoo can help over- 
come. 



1.1.1 Close-Pair Statistics 

Accurate measures of position and redshift in large scale 
surveys have made straightforward the task of finding close 
pairs in the (local) universe. However, the peculiar veloci- 
ties of the individual galaxies can significantly offset their 
redshift-inferred line-of-sight separation producing spectral 
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pairs of non-interacting systems which may or may not 
merge. One can apply statistical arguments as to what frac- 
tion of close-pairs within some ensemble will soon merge, but 
no single system can be safely assumed to be merging with- 
out cross-examination. The method is best used, therefore, 
to estimate merger rates parametrised by the convention: 
merger rate ~ (1 + z) m . 

Another limitation of close-pairs methods in surveys 
such as SDSS is the requirement that both galaxies have 
spectra. This requires a delicate act of deblending by the 
data-reduction pipeline. SDSS will convert the photome- 
try of an extended body like a galaxy (the 'parent') into 
individual photometric 'objects' (the 'children') depend- 
ing on how many peak s are present in the blended image 
(|Stoughton et all 120021 ). Iterations are performed allotting 
different magnitudes to the child-objects until an optimal 
distribution is attained so that all children sum to give the 
total flux of the parent. 

Any photometric-child object with Petrosian magnitude 
r < 17.77 is designated as a spectral target by the SDSS 
pipeline. It follows that only when a contiguous body, such as 
two interacting galaxies, is deblended into (at least) two pho- 
tometric objects both having r < 17.77 can it contain two 
spectral targets. Therefore mergers with only one deblended 
object of r < 17.77 cannot be identified by the close-pairs 
technique (|Strauss et alj|2002l : IPatton et al j|2008h . This will 
occur for many minor mergers or for late stage mergers 
where the cores have drawn close to each other and the 
pipeline reads them as a single peak. Figure Q] shows exam- 
ples of strongly-perturbed systems with only one spectral 
target. 

Even when two objects are deblended in a contiguous 
image and registered as spectral targets - relatively few sys- 
tems will obtain spectra on both objects due to fiber colli- 
sions within the spectrometer. Two SDSS spectroscopic tar- 
gets cannot acquire spectra simultaneously if they are within 
55" of each other and so only systems contained within 'tile 
overlap regions' can have spectral objects with an angular 
separ ation less than this (jStrauss et alj|2002l . iBlanton et alj 
120031 '). Only about ~ 30% of the SDSS sky rests within over- 
lap regions thus limiting the sample size of such systems. 

Conversely, a system deblended into multiple targets, as 
in Figure [2] can acquire too many spectral objects so that a 
single galaxy might appear as a close-pair or a binary merger 
might appear as a multi-merger. Close-pairs studies typically 
limit the r edshift and magnitude ranges in order to avoid 
this effect. IWoods fc Gellerl (|2007f l. for example, limit their 
close-pairs sample to ~ 0.027 < z < 0.17 and require > 20% 
of the galaxy's total flux land within the fiber aperture in 
order to exclude very extended nearby galaxies. They still 
find though that ~ 15% of their minor-pairs catalogue are 
false pairs after visual inspection. 

To summarise, without visual cross-examination, close- 
pairs methods are prone to include false pairs and non- 
gravitationally bound systems and are best used, therefore, 
to estimate merger rates via application to large ensembles 
after estimations for contamination and incompleteness are 
taken into account. To examine the properties of merger sys- 
tems, one should robustly identify actual mergers as opposed 
to claims that some fraction of an ensemble is 'likely' to be 
merging. 




Figure 2. Example image of binary merger with four spectral ob- 
jects (red squares). It is difficult for 'blind' close-pairs techniques 
to distinguish this from a pair of binary mergers (or a four-way 
merger). 



1.1.2 Automated Quantification of Morphological 
Disturbance 

Pattern recognition techniques applied to galaxy images 
pose a formidable programming challenge. Background noise 
and contaminants first need to be removed, then an auto- 
mated technique needs to quantify how 'disturbed' a mor- 
phology is independent of viewing angle. This might not be 
so difficult if it were not the case that unperturbed galax- 
ies themselves vary so much from the structurally simple 
(ellipticals) to the complex (spirals). In particular, filtering 
asymmetric systems like mergers from spirals with extended 
star-formation requires great finesse. 

Nonetheless, promising automated techniques have 
been developed in recent years that generate parameters re- 
lated to morphology. A common technique has been to mea- 
sure concentration (C), asymmetry (A) and, in some cases, 
clumpiness/smoothness (S). One then partitions CA(S) 
space into morphological categories and identifies an object 
as a merger if i t lie s within the designate d sub-volume (see 
IConselicd 120031 and IConselice et al.ll2003l on the use of the 
'CAS' system to locate mergers). 

Another pair of pa rameters used in thi s fashion are 
the Gini-coefficient (G) (I Abraham et alj|2003h and second- 
order moment of the bri ghtest 20% of the galaxy's light 
(Mao), most notably by lLotz et ahl (|2008al ). (For a con- 
cise des cription of how to calculate C , A, S, G and 
M20 see IConselice. Raigor fc Mversl 120081 .) It is argued in 



lLotz. Primack fc Madau ( 2004 ) that this technique operates 
better at low signal-to-noise ratios and is more sensitive to 
late-stage m orphologies than the CAS system (though see 
iLiskerllioOSl for a critique of its effectiveness). More recent 
stud ies have combined and compared the two techniques 
(e.g. IScarlata et all 120071 ; IConselice. Raigor fc Myers! 120081 : 
lLotz et alfeOOSbl ). Artifical Neural Nets a re another promis- 
ing technique fo r identifying morpholgies l|Lahav et al.lll996l . 
iBall et ahll2004l ) though they have not yet been applied to 
merger studies specifically. 

CAS and GM20 remain effective at high redshifts so 
long as the image quality remains high and, since they are 
automated, can process images quickly once the pipeline has 
been established. However, these techniques also require vi- 
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sual examination to cross-check results and to fine-tune the 

partition boundaries that define a merger. 

Systematic uncertainties usually remain. Ijpgee et al.l 

(|2008l ) recently report that the CAS criterion failed to pick 
up 37 — 58% of their visually classified "strongly disturbed" 
morphologies while including a "significant number of rela- 
tively norm al galaxies" - an eff ec t that was predicted by sim- 
ulatio ns in IConselicel (|2006bT ). IConselice. Rajgor fc Mversl 
(|2008T ) recently classified 993 images from the Hubble-Ultra- 
Deep Field by visual inspection, the CAS volume and the 
GM20 area. They found some notable disagreements, in par- 
ticular, some systems identified as mergers by their location 
in GM20 space are not identified as mergers using the CAS 
system and vice-versa (see for example Figures 8 and 9 plus 
captions). Within the 0.4 < z < 0.8 range only 44 ± 6% 
of the galaxies mapped into the GM20 merger region were 
visually classi fied as 'pecu l iar.' 

Recently lLotz et al] (|2008bh performed an extensive 
study on the merger-detection sensitivity of the CA, GM20 
and close-pairs techniques using simulations. They find that 
C, A, G and M20 methods are only sensitive to mergers at 
specific stages of the process, particularly the first pass and 
final coalescence of the galaxies. The study also confirms 
that the merger time-scales and parameters (such as gas- 
fractions, pericentric distance and relative orientation), for 
which these three techniques remain sensitive, differ signifi- 
cantly. 

Evidently then, much work remains before we can con- 
fidently do away with visual inspection of images in merger 
studies^ 



1.1.3 Visual Inspection by Research Groups 

In the pre-digital age, surveys of morphological classification 
by visual inspe ction were mostly limited to studies based on 
cluster images (|Oemlerlll974l . lDresslen[l98oT ) . With the mod- 
ern development of large surveys and high resolution imag- 
ing, the potential to extract large samples of morphologi- 
cally classified galaxies in differing environments has vastly 
improved. The largest visual clas s ificatio n project by a re- 
search group is ISchawinski et al.1 (|2007aT ) who visually ex- 
amined 48,023 SDSS galaxies in compiling the MOSES cat- 
alogue (Morphologically Selected Ellipticals in SDSS). They 
found a significan t blue-population that had been excluded 
by studies such as iBernardi et al.l (|2003T l which identified el- 
lipticals by their presumed properties. Selecting morpholo- 
gies by a priori assumptions is often pragmatic and neces- 
sary but ultimately begs the question as to what the prop- 
erties of a given morphology actually are. Unless we can be 
certain that a set of properties maps one-to-one with mor- 
phology we must continue with visual examination. 

Studies to select mergers by visual-selection have 
not reached s uch scales as MOSES until this work. 
iLe Fevre et~aH (|2000T l visually examined 285 Hubble images 
and found a merger fra ction of 10 ± 2% over < z < 1.2. 
iNakamura et al.l ( 20031 ) visually classified 2418 images of 
SDSS galaxy objects with Petrosian r < 16.0 (i.e. low red- 
shift) finding that 35/1875 of their morphological classifi- 



cations were Im ('highly irregular' following Hubble's nota- 
tion). A similar study by the same group, iFukugita et al.l 
d20Q7f ), visually classified 2658 images from SDSS with Pet- 
rosian r < 15.9 and found that 1.5% — 1.7% of these nearby 
magnitude limited galaxies were 'interacting.' 

Thus, until now, merger studies have been limited to 
catalogues of ~ 10 3 galaxies due to the time-consuming and 
monotonous nature of the task. By contrast Galaxy Zoo al- 
lows us to acquire effective visual classification for morpholo- 
gies and mergers for samples of ~ 10 5 galaxies with relative 



1.2 Locating Mergers with Galaxy Zoo 

Galaxy Zoo (GZ) is a user interface on the world-wide we 
drawing upon images from SDSS. Volunteers from the pub- 
lic are instructed and commisioned with the collective task 
of visual classification of ~ 900, 000 galaxies from SDSS 
DR6. A complete description of the project in cluding design 
detail s and initial data reduction is given by iLintott et al.l 
(|2008l ). It is also shown that public users are, in large num- 
bers, about as good as 'experts' at identifying morphologies. 

The project has proved to be a tremendous suc- 
cess. To date, over 140,000 volunteers have participated 
and collectively offered classifications for all ~ 900, 000 
images, on average, fifty times over. Data from the 
project has already been employed to find t he statisti- 
cal p r operties of spiral-g alaxy-spin orientation (|Land et al.l 
2008: SI osar et al] 120090 . the relati onships between envi- 
ronment, morpho logy and colour (|Bamford et al] l2008al ; 
ISkibba et al.ll2008h , to stu dy optically blue early-t ype galax- 
ies at very low redshift (|Schawinski et all 120091) and has 
also l ead to some serendipit ous discoveries ( Lintott et al.l 
120091 ; I Cardamone et all I2009T I . This work concentrates on 
the results of the merger-location functionality of Galaxy 
Zoo whose details we now describe. 



2 "Men trust their ears less than their eyes." Herodutus, 
The Histories, 5th c. B.C., 1.8, trans D. Godley. 



2 THE GALAXY ZOO DATA 

2.1 Constructing a Merging- Pairs Catalogue 

Galaxy-Zoo users are asked to classify an SDSS target as 

(i) elliptical (e) 

(ii) spiral (sjj 

(iii) star/bad image (b), or 

(iv) merger (m). 

The Galaxy Zoo project is the first of its kind and so, 
not knowing how well it would be taken up by the public, 
the design of the interface prioritised simplicity over detail. 
A single button labeled 'merger' is all that was offered. 

The raw data we use to build our catalogue is what 
we call the weighted-merger-vote fraction (/m)u GZ has ob- 
tained a value for this parameter for 893, 292 SDSS galaxies 



3 http://www.galaxyzoo.org/ 

4 More specifically, users are asked to specify whether a spiral 
galaxy is rotating clockwise, anti-clockwise or is edge-on/unclear. 

5 Similar parameters are calculated for the other categories such 
that f e + f s + f b + f m = 1. 
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Figure 3. Example images of prospective merger systems. The number given for each image is its weighted-merger- vote fraction, / m . The 
panels of nine images are grouped into the following bins: top-left: < f m ^ 0.05, top-right: 0.05 < f m ^ 0.2; middle-left: 0.2 < / m sC 0.4, 
middle-right: 0.4 < f m sC 0.6; bottom-left: 0.6 < f m SC 0.8, bottom-right: 0.8 < / m 1.0. Our catalogue is made up of mergers with 
/ m > 0.4. Each image tile is 40" X 40". 
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Figure 4. The distribution of weightcd-merger-vote fractions in 
the Galaxy Zoo database for objects with spectra for 0.005 < z < 
0.1. From these we use objects with 0.4 < f m $C 1.0 to construct 
our merging-pairs catalogue (cross-hatched). There are 157,376 
objects with f m = exactly. The numbers on the graph show 
the occupancy of each bin (e.g. there are 109, 433 with < f m ^ 
0.05). In total, 304,182 Galaxy Zoo spectral objects lie between 
0.005 < z < 0.1. 



from DR60 The f m values are calculated by taking the num- 
ber of merger classifications (n m ) for a given GZ object and 
dividing it by the total number of classifications (n e ,s,6,m) for 
that object multiplied by a weighting factor W that mea- 
sures the quality of the particular users that have assessed 
the object. The quality of an individual user is determined 
by measuring to what extent that person agrees with the 
majority opinion for all objects the individual has viewed. 
The weighting factor, W, thereby represents all the itera- 
tions carried out by equations (1) & (2) of iTintott et alj 



fm — 



Wn m 



(i) 



The parameter f m ranges from to 1 so that an object 
with fm~0 should look nothing like a merger and f m = 1 
should look unmistakably so. Figure [3] shows some example 
images taken from GZ labeled by their f m values. It is in- 
teresting to see how low f m is when these images start to 
look, at least superficially, like mergers (/ m ~ 0.2 — 0.4). Ev- 
idently, users were rather conservative in calling something 
a merger. 

Figure [4] shows the distribution of f m for the entire GZ 
catalogue. In building our first merger catalogue we deter- 
mined a cut-off point for f m above which most systems will 
be proper galactic mergers and involve a sample size man- 
ageable by a research team. By comparing the merger-vote 
fraction with images like those in Figure [3] we decided to 
build our first catalogue using only systems with f m > 0.4 



6 This is the same parent population obt ained after six - month s 
of running Galaxy Zoo that is u sed in iLintott et al.l ll 2008t). 
lLand et all 1120081 ). iBamford et alj ll2008al) and ISchawinski et al.l 
J2009D . 

7 W here is not the of lLintott et al.ll2008l . is the weight- 
ing of each individual user whereas W represents their combined 
weighting for each individual GZ object. 



and only using GZ objects with spectra whose spectroscopic 
redshift lies in the range 0.005 < z < 0.1. SDSS spectro- 
scopic targets are selected for galaxies with apparent mag- 
nitude r < 17.77 which corresponds to M r < —20.55 at 
2 = 0.1. This absolute magnitude corresponds to a mini- 
mum stellar mass of ~ 10 10 Mq (see Figure |SJ) so that our 
upper limit (z = 0.1) of our volume-limited sample will be 
inclusive of intermediate size galaxies[f] The resolution of im- 
ages beyond z > 0.1 is rapidly diminishing which would lead 
to unreliable visual classifications of morphology. 

The lower limit z — 0.005 is to minimise the number 
of mergers that go undetected due to an incomplete field 
of view(f| These cases are rare since the number of SDSS 
objects peaks near z = 0.08 and only a few percent have 
z < 0.02. 

After applying the cut to only those objects with spec- 
tral redshifts between 0.005 < z < 0.1, we have 304,182 
objects (see Figure UJ. To find mergers within this set, we 
apply the cut 0.4 < f m ^ 1.0 leaving 4198 GZ objects with 
spectra. We exclude those SDSS targets which are yet to ac- 
quire spectra as we desire accurately measured redshifts F"l 

Although the purpose of the Galaxy Zoo project is to 
significantly reduce the need for research teams to visually 
inspect large catalogues, it is still necessary at this stage 
to double check the results. Visual re-examination of the 
sample by our group allowed us to 

(i) remove any non-merging systems, 

(ii) visually select an appropriate SDSS object to repre- 
sent the merging partner, and 

(iii) assign morphologies to the galaxies in each merging 
system. 

We briefly describe our methods for each of these tasks. 



8 For clarity, the merger catalogue that we construct is not 
volume-limited, i.e. the systems are red-shift limited (0.005 < 
z < 0.1) but the absolute magnitude (M r ) is subject to no for- 
mal constraint in order for that object to belong to the catalogue, 
although its apparent magnitude must have been such that the 
SDSS pipeline deblended the system into a spectral target (with 
r < 17.77). For certain investigations, however, it is important to 
impose an absolute-magnitude cut of M r < —20.55 on the cat- 
alogue in order to ensure completeness across the redshift range 
we are using. We state when we do this and refer to such a subset 
as a 'volume-limited' sample. 

9 This arises because GZ images are scaled for viewing accord- 
ing to the Petrosian radius of the object's model magnitude. The 
photometry of very large and close-by galaxies is often deblended 
into multiple SDSS objects. The more deblended objects there 
are, the less the Petrosian radius of any single deblended object 
represents the galaxy as a whole and this brings about an inap- 
propriately small image-scale for viewing some close-by galaxies. 
Such systems would be viewed by users as nothing but a galaxy 
core and not, consequently, voted as a merger. 

10 Mergers are inherently 'messy' systems and so photometric 
redshifts calculated by comparison to standard templates are 
prone to error. In fact we found the mean absolute difference 
between all the photometric and spectroscopic redshifts in our 
merger sample to be ~ 0.051 - more than half of our redshift 
range! 
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2.2 Removing Non-Merging Systems 

The examples of Figure [3] demonstrate that the weighted- 
merger-vote fraction is strongly correlated with how 
'merger-like' an image appears to be. The outcomes of GZ 
are therefore similar in effect to automated methods like 
CAS and GM20 which also map images to parameter spaces. 
In our case though f m is a single parameter (making it easier 
to divide up 'merger' and 'non-merger' zones) and, by utilis- 
ing the pattern-recognition capacity of many human minds, 
overcomes the need to remove background noise, recognise 
anomalies, etc. We find that for f m > 0.6, all systems are 
robust mergers. However, three causes for mis-classification 
begin to emerge as f m decreases and become common for 
fm < 0.4: 

(i) projection of galaxies along the line of sight, 

(ii) projection of nearby stars onto distant galaxies and 

(iii) cases which are 'border-line' mergers. 

Galactic projections occur when two galaxies have sim- 
ilar celestial coordinates but are separated by a significant 
radial distance. We can easily spot projections when both 
galaxies have spectral redshifts. However, many of our can- 
didate systems have only one redshift. Spotting a projection 
in such cases can only be done through visual examination 
of the image for signs of interaction. Our choice to use only 
systems with f m > 0.4 meant, however, that the need for 
such difficult decisions was rare. 

Stellar projections were the more common problem. We 
were able to eliminate stars easily from our sample though 
by their characteristic point spread function (PSF) using the 
associated parameter 'type' from the SDSS PhotoTag table. 
This is a discrete label given to every photometric object 
in the SDSS database that indicates whether an object is 
'point-like' or 'extended' and can, on this basis, reliably de- 
termine (> 98%) wh ether a luminous object is a star or not 
(|Strauss et al.ll2002f ). 

'Border-line' cases involve galaxies that are morpho- 
logically disturbed but do not necessarily merit the term 
'merger.' All galaxies are merging with something which 
can range from molecules (accretion) to a small galaxy 
(minor merger) to a galaxy roughly its own size (major 
merger). Deciding where in this spectrum of possibilities 
the term 'merger' becomes appropriate is rather subjec- 
tive. At this stage, therefore, our strategy was simply to 
decide by visual inspection whether a given GZ galaxy had 
a 'strongly-perturbed' morphology. By 'strongly-perturbed' 
here we mean that a galaxy was in a morphological state that 
was unlikely to have occurred without some external inter- 
action. Starting with this inclusive criterion allows us obtain 
completeness for anything deserving the term 'merger.' We 
found in fact that borderline cases were again rare for sys- 
tems with f m > 0.4. However, difficult decisions regarding 
projections and border-line cases are abundant for systems 
where 0.05 < f m ^ 0.4 (for example the image of Figure [3] 
with f m = 0.175) and it will therefore require a great deal 
of care if we wish to expand our catalogue into the f m < 0.4 
range in the future. 

We then examine each 'strongly-perturbed' system with 
the aim of selecting an SDSS object to represent the body 
responsible for the disruption caused to the galaxy labeled 
with f m > 0.4 (if any is identifiable). The details of this 




Figure 5. Images exemplifying the construction of the merging- 
pairs catalogue. All spectral targets (red boxes) are Galaxy Zoo 
objects. In the upper panel, we select the two spectral objects 
(1 and 2) from Galaxy Zoo to represent the merging pair. The 
merging system in the lower panel only has one spectral target 
(3) which was found by Galaxy Zoo. We therefore examine all 
neighbouring SDSS objects in order to select one (in this case 4) 
to represent the merging partner galaxy. Our final catalogue has 
3003 such pairs. 
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Figure 6. Example images of multi-merging systems. We gener- 
ally included these in the binary merger catalogue by selecting 
the closest and brightest galaxy to accompany the Galaxy-Zoo 
object. We found 39 systems that could confidently be described 
as multi-mergers. 



procedure are discussed below. With photometric objects 
representing the 'perturber' we can distinguish major merg- 
ers more objectively and their completeness can be assumed 
since they are a subset of 'strongly-perturbed' systems. 

2.3 Visual Selection of Merging Partner 

As mentioned, in order to create a catalogue of merging 
pairs, we needed to manually select an appropriate SDSS 
object as a partner for the object supplied by GZ with 
fm > 0.4. We did this using an IDL routine that allows for 
the rapid examination of the image and photometry of all 
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objects within 30" of the GZ object given by the Neighbors 
table in the SDSS database. We choose whichever object ap- 
pears to be the most 'plausible' representation of the body 
responsible for the morphological disruption to the galaxy 
represented by the GZ object. Plausibility was judged on 
the basis of object brightness (with brighter objects in the 
r-band being preferable) and visual 'common-sense.' See 
Figure [5] for examples of this procedure. For most systems, 
which are binary mergers, this choice was straightforward 
as they usually have either 

(a) spectra centred on both galaxies or 

(b) a spectral object centred on one galaxy and only 
photometry on the other. 

However, not all 'strongly-perturbed' systems ap- 
pear as simple binary mergers. Additionally, there are cases 
where 

(c) galaxies are in the final stages of a merger and its 
progenitors are no longer distinguished by the SDSS 
pipeline ('post-mergers'), 

(d) a galaxy has been perturbed by a close encounter with 
a neighbour no longer in view ('fly-by') and, occasionally, 

(e) a merging system involves three or more galaxies. 

The wide range of possibilities makes merger taxon- 
omy a difficult task. In constructing a merging-pairs 
catalogue we therefore proceed as follows for these various 
cases. 

For case (a) we usually choose these two spectral objects 
to represent our merging pair (see figure |SJ) . If the merger 
companion does not have spectra then we visually select the 
best photometric object available to represent the merging 
partner (case b). 

Cases (c) and (d) are sometimes difficult to distinguish 
and usually occur when only one galaxy core is apparent 
with the peripheries undergoing extensive tidal disruption. 
This usually means that no photometric object is available 
to plausibly represent the perturbing body and so, in such 
cases, we simply decline to select a merging partner. They 
remain in the category of 'strongly-perturbed' systems (and 
are included in our calculation of the merger fraction; see 
tj3.ll and H3.3P but are not included in the merging-pairs 
catalogue. Figure [1] shows examples of these two categories. 

In the case of (e), where several galaxies are merging at 
once, we decided to first note them and then include them 
in the merging-pairs catalogue by selecting the closest and 
brightest object to the GZ-supplied galaxy. Figure [S] shows 
examples of such systems. The catalogue is technically a mix 
between 'binary-mergers' and 'multi-mergers' and so we call 
it the 'merging-pairs' catalogue since it contains 3003 pairs 
of galaxy objects which are all in merging systems. 

To summarise, through this refining and pairing process 
we converted 4198 objects to 681 pairs where both objects 
have spectra (case a) and 2322 pairs where only one ob- 
ject has spectra (case b). These make the 3003 pairs of the 
catalogue. 370 of the 4198 systems were considered strongly- 
perturbed but unsuitable to be put into a pair (cases c and 
d). Only 39 of the 3003 pairs are confirmed to be multi- 



mergers (case e). The remaining 144 object^]] were dis- 
carded because they were deemed to be in non-strongly- 
perturbed systems (mainly stellar overlaps). 

2.4 Assigning Morphologies 

Morphologies were assigned to each SDSS object in our 
merging-pairs catalogue by a single classifier (DWD) work- 
ing with consultation. We use four classifications for the 
merging galaxies: E, S, EU and SU. The E and S classifica- 
tions respectively label those galaxies which are clearly ellip- 
tical and spiral by morphology. No appeal to colour should 
be necessary. The EU and SU are those ellipticals and spirals 
about which we are 'unsure,' in other words, this is our best 
guess0 The 'unsure' morphologies are usually more distant 
objects whose image resolution is too poor to distinguish 
features like spiral arms. Choosing between EU and SU can 
be very difficult and is based mostly on apparent surface- 
brightness profile and, in very difficult cases, on colour. (See 
Figure [7] for examples and further details of our decision- 
making criteria.) 

A simple means to select morphology in SDSS is via 
the SDSS fracdev parameter measured in the r-band that 
ranges from to 1. This is a measure of the goodness-of-fit 
of a galaxy's surface-brightness to a de Vaucouleur profile. 
Ellipticals tend to have a fracdev ~ 1 whereas spirals have a 
wide distribution. We find that our morphological categories 
S, SU, EU and E have mean fracdev values of 0.48, 0.85, 0.93 
and 0.94 respectively fitting qualitatively with expectation. 
The high value of 0.85 for the SU category suggests there is 
contamination by bulge dominated discs or ellipticals as dis- 
cussed in £13.51 This is expected si nce distant, poorly-reso lved 
spirals can look like ellipticals (jBamford et all l2008al ). To 
avoid such contamination we can restrict ourselves at any- 
time to using only the 'sure' morphologies. 



3 THE MERGER FRACTION OF THE LOCAL 
UNIVERSE 

Our large merging-pairs catalogue and f m values given by 
GZ for 893, 292 spectral objects enables us to address two 
distinct and important questions which are highly relevant 
to the field of galaxy evolution: 

(i) what is the fraction of merging galaxies in the local 
universe? and 

(ii) what is the ratio of spirals to ellipticals (N 3 /N e ) in 
mergers in the local universe? 

3.1 Estimating the Merger Fraction (I) 

Our merger location technique is different to those before it 
and is suited therefore to a different procedure for finding 
a meaningful 'merger fraction' which we make explicit here. 
First, we must find the fraction of volume-limited galax- 
ies in the local universe currently in a 'strongly-perturbed' 
state. To accomplish this we use the subset of our 893,292 

11 4198 - 3003 - 681 - 370 = 144. 

12 We refer to the set of merging galaxies with elliptical classifi- 
cations as 'E+EU' galaxies, etc. 
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Figure 7. Example images of galaxies for morphological cate- 
gories E, S, EU and SU. Any combination of these is possible. 
Poorer image resolution as z — > 0.1 affects certitude of morpho- 
logical components. The bottom panels show examples of 'unsure' 
morpholgies. If a morphologly is unclear, it will normally be as- 
signed EU type by surface brightness profile if there is a drop off 
in brightness from the centre. SU types generally have a more 
uniform surface brightness profile in our approach. The colour of 
the RBG images was also used as a visual guide in very difficult 
cases under the assumption that EU types should be more red. 
Image widths: E-E ~ 120", S-S ~ 240", EU-EU ~ 40" & SU-SU 
~ 30". 



GZ spectral objects that are members of the Main-Galaxy- 
Spectral sample (MGS; IStrauss et all I2002T ) in SDSS and 
estimate what fraction of them are 'strongly-perturbed.' By 
using only objects with spectra we are able to volume-limit 
the sample (by the usual M r < —20.55 constraint) which is 
necessary for a meaningful merger fraction. 



E 



fmgs 



'Strongly-perturbed' volume- limited 
MGS spectral objects in SDSS 



E 



All volume-limited 
MGS spectral objects in SDSS 



(2) 



The use of only those spectral objects which are in the 
MGS leads to a good approximation for the real merger frac- 
tion, f rea i, so long as an appropriate factor C is applied to 
correct for spectroscopic incompleteness, i.e. / rea i = Cf mg s 
where we estimate C ~ 1.5. Justification for this claim with 
a brief discussion of the MGS and spectroscopic targeting in 
SDSS is given in Appendix A. 

The merging-pairs catalogue produced in 32]is large but 
incomplete since it is constructed only from systems with 
fm > 0.4. Finding fmgs therefore requires extrapolation into 
the < fm < 0.4 region. To find the numerator of ©, we 
therefore took volume-limited samples of 100 GZ-spectral 
objects from the bins < f m < 0.05, 0.05 < f m < 0.20 
and 0.20 < f m < 0.40 which are also in the MGS. By vi- 
sually inspecting each set of 100 galaxies twice (the first 
time round we made our decisions very conservatively, the 



second time round very liberally) we obtained estimates for 
the percentages of 'strongly-perturbed' galaxies within these 
bins. These were: 0-2%, 18 - 34% and 50 - 59% for the 
0.0 < f m < 0.05,0.05 < fm < 0.20 and 0.20 < f m < 0.40 
bins respectivelyLJ There are, of course, many more objects 
in these bins than in those with f m > 0.4 (see Figure[4| and 
so end up contributing to at least two thirds of all 'strongly- 
perturbed' systems. We assumed that no GZ object with 
fm = would be 'strongly-perturbed.' 

Applying these estimated fractions to the total num- 
ber of MGS objects in each bin and adding the 'strongly- 
perturbed' and volume-limited MGS objects from the 4198 
GZ objects with f m > 0.4 gives us the numerator for equa- 
tion The denominator of equation ((2| is found to be 
157,801. Dividing these gives the fraction fmgs = 4 — 6%. 
We sum up our result for f rea i in the following way: 

The Merger Fraction (Weak Statement) 

~ 4 — 6 X C% of all galaxies in a volume-limited sample (M r < 
—20.55) in the local universe (0.005 < z < 0.1) are 'strongly- 
perturbed.' 

We call this the weak statement because of the subjec- 
tive nature of deciding whether or not a system is 'strongly- 
perturbed' or not. The error in this percentage arises from 
the upper and lower bounds estimated for the three sam- 
ples of 100 images (before the correction factor C ~ 1.5 is 
applied). In i ]3.3l we use the 3003 systems from the merging- 
pairs catalogue (plus a few more plausible assumptions) to 
offer a stronger statement giving the fraction of major merg- 
ers in the local universe. 



3.2 The Stellar Masses of Merging Galaxies 

Of the 'strongly-perturbed' systems, we need to find what 
subset comprise major mergers. For this study, we define a 
major merger to be two merging galaxies of stellar masses 
M* and M2 where 1/3 < M'/MJ < 3 and so, to estimate the 
subset of major mergers, we need to calculate stellar masses 
for the galaxies in the merging-pairs catalogue. 

We do this by fitting the SDSS photometry for each ob- 
ject in our catalogue to a library of photometries produced 
by a variety of two-component star formation hi s tories. The 
approach is similar to that of ISchawinski et alj (|2007al ) ex- 
cept that we do not use the information contained in the 



13 We also double checked our results with an additional interface 
on the world-wide web designed to re-examine all objects within 
the range 0.2 < fm < 0.4. After a few months, enough users 
had re-classified these images of interest in order that a new- 
wcightcd-merger-vote fraction, f m , could be calculated for each 
image based upon clicks of the 'merger,' 'not-merger' and 'don't 
know' buttons. By then volume-limiting all spectral objects from 
this sample as before with 0.005 < z < 0.1 and M r < —20.55 
we are able to filter out more actual 'strongly-perturbed' sys- 
tems. However, we again needed to decide a cutoff for the vote 
fraction, f' m . We examined twenty sets of ten images across the 
entire f' m range and estimated how many of these were genuine 
'strongly-perturbed' systems. We found that > 80% of images 
were a ctual 'strongly-pertu rbed' systems for f' m > 0.6 and, fol- 
lowing [Lirtott^et^al] q2008f ) . made this number the cut off. The 
fraction of objects with f' m > 0.6 is ~ 53% which is in good agree- 
ment with our estimate that 50 — 59% of objects in this range are 
'strongly perturbed.' 
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Figure 8. Relation between stellar mass and absolute magnitude 
in the r-band. The solid horizontal line at M r = —20.55 corre- 
sponds to the cut used to obtain our volume-limited samples. This 
cut removes virtually all galaxies with mass M < 10 10 Mq as well 
as galaxies up to a mass of M ~ 7 X 10 10 Mq. 



stellar absorption indices. The library SEDs are generated 
using the Maraston (1998, 2005) stellar models. Both com- 
ponents have stellar populations with variable age with fixe d 
solar metallicity and Salpeter IMF (jScarlata et al.|[l955h . 
The first (older) burst is a simple stellar population (SSP), 
the second (more recent) burst is modeled by an exponen- 
tial with variable e-folding time. The purpose of the varying 
e-folding times is to account for galaxies with extended star 
formation histories. This is especially important for merg- 
ers which are likely to have undergone recent star-formation 
episodes. Dust is implemented using a Calzetti et al. law 
(|Calzettill2OO0h that is free to vary for E(B-V) over to 0.6. 
It should be noted that mergers, by their very nature, are 
prone to mix and overlap and this could lead to additional 
reddening. 

It is important that our cut for the merger fraction be 
based upon magnitude and not mass since it is the pho- 
tometric brightness of the object that determines whether 
or not it will end up in the MGS (which is integral to our 
defmtion of the merger fraction) . In Figure [8] we examine 
the effect of imposing our volume-limiting cut on the mass 
distributions. Mass scales with brightness though the scatter 
is substantial such that the magnitude limit M r < —20.55 
removes all galaxies with < 10 10 Mq and some as massive 
as ~ 7 x 10 10 Mq. It is therefore undesirable to estimate the 
merger fraction using a mass cut (instead of a magnitude 
cut) for this redshift range since one would need to cut at 
no less than ~ 7 x 10 10 Mq to ensure completeness and this 
would greatly reduce the sample size. It would also make a 
calculation of the merger fraction impractical since we would 
then need masses calculated for the entire MGS sample to 
get the denominator of equation ([2]) and this would be an 
enormous computational task. In D09b we examine the dis- 
tributions of these stellar masses in more detail and in com- 
parison to a control sample. 



3.3 Estimating the Merger Fraction (II) 
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Figure 9. This figure illustrates the occupancy of regions in 
mass- mass space for all the merging-pairs where the more massive 
galaxy has M r < —20.55. Points between the 1 : 1 and 1 : 3 lines 
are major mergers. We only assume completeness within this strip 
in this study. The boxes in the upper panel divide the sample by 
morphology. 



We now apply the information obtained in £13. II (the frac- 
tion of 'strongly-perturbed' galaxies) with that obtained in 



i|3.2l (stellar-mass estimates for objects in the merging-pairs 
catalogue) in order to estimate the fraction of major merg- 
ers in a volume-limited sample (M r < —20.55) in the local 
universe (0.005 < z < 0.1). 

Figure [9] illustrates the occupation of the mass space of 
the merging pairs in our catalogue with the magnitude limit 
imposed on the more massive galaxy in the pairH This cri- 
terion allows us to view both major and minor mergers. We 
find that ~ 50% of these points lie within the major-merger 
strip. Estimation of the number of minor mergers within 
our volume-limited ranges is difficult since their complete- 
ness rapidly diminishes as M1/M2 increases F^l 



14 As a reminder, the catalogue of 3003 merging pairs is not 
volume-limited, i.e. there is no formal constraint for either object 
in the pair to have a minimum absolute brightness. This means 
that in order to get a sample that is complete over the redshift 
range in use, we need to impose an absolute magnitude cut to all 
systems therein (M r < —20.55 since, at z = 0.1, this will select 
all galaxies with r < 17.77, the minimum brightness needed to 
be designated as a spectral target). However, for mergers where 
we have two objects, one can choose to impose this absolute- 
magnitude constraint either on both objects in the pair or only 
on the largest/brightest object. We choose the latter option for 
this particular graph since it is more inclusive of minor-mergers. 
5 Mergers between systems with /M? > 1000 arc of course 
abundant in the universe (every galaxy is merging with something 
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However, we can plausibly assume completeness for ma- 
jor mergers since, by the nature of the images they produce, 
they are easy to spot. Knowledge of the stellar-masses thus 
allows us to estimate what subset of the 'strongly-perturbed' 
MGS spectral objects in the local universe which we found 
in A3. II are major mergers by the technical definition of 
1/3 < Ml/M* 2 < 30 

To get a merger fraction, we again need to apply our 
analysis soley to volume-limited objects within the MGS (3 
Of the 3864 spectral objects in our 3003 binary-merger pairs, 
2306 have M r < —20.55. Of these, 1243 are in major merg- 
ers leaving 1063 in minor mergers. We therefore find that 
1243/2306 ~ 54% of the volume-limited objects with spectra 
taken from our catalogue (which all have f m > 0.4 according 
to how they were selected) are in major mergers. 

We can now estimate an upper-limit for the fraction of 
major mergers in the local universe by supposing that this 
fraction of major mergers (~ 54%) from what was origi- 
nally classified as a set of 'strongly-perturbed' systems will 
be the same for all strongly-perturbed systems in the range 
f m < 0.40 This is certain to be an overestimate since there 
are bound to be a higher portion of major mergers in systems 
with high f m . The reason for this is that most 'strongly- 
perturbed' systems obtain a low f m ~ 0.2 precisely because 
they are mostly 'border-line' cases (i.e. perturbations caused 
by interactions or very minor mergers). Figure flOl confirms 
this expected relationship over the range of f m for our cat- 
alogue, that is, the ratio of major-to-minor mergers in our 
sample is seen to increase with f m . In other words, users 
tended to more readily spot mergers involving galaxies of 
roughly equal mass. Applying this fraction of 54% to the 
upper limit of our fmgs gives 0.54 x 6% — 3.24% which, be- 
ing an overestimate, we can plausibly round down to ~ 3%. 

We can also obtain a lower-limit to the major-merger 
fraction by supposing that the major mergers in our cata- 
logue comprise all of the major mergers in the local universe. 
That is, we can suppose that there are no major mergers at 
all in the range f m < 0.4 for our volume-limited sample. 
This is of course an underestimate since there are bound to 
be at least some major mergers in, for example, the range 
0.2 < f m < 0.4. Applying this assumption gives the lower 
limit of 1243/157, 801 ~ 0.8%E3 Again, since this is un- 
doubtedly an underestimate, we can plausibly round this 
limit up to ~ 1%. We summarise this working in the follow- 
ing way. 



small) but will not, by their very nature, get spotted in a merger 
study. We would therefore not be able to derive the abundance of 
minor-mergers in the local universe here with much confidence. 

16 As opposed to a purely subjective decision based upon visual 
inspection of the image. 

17 Since we use the MGS to establish the denominator of equation 
©. 

18 Recall that, when we estimated the fraction of 'strongly- 
perturbed' systems as expressed in the Weak Statement, we 
needed to estimate how many MGS objects with f m < 0.4 would 
be classified as 'strongly-perturbed.' Some of these will be major 
mergers, the rest minor. By assuming that the fraction of major 
mergers in our catalogue (with f m > 0.4) is the same for all the 
rest (f m < 0.4), we obtain an upper limit for how many major 
mergers are in the local universe.) 

19 Where 157,801 is the denominator of J2T . 
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Figure 10. The relationship of major-to-minor-mergers ratio for 
the merging-pairs catalogue over the 0.4 < f m < 1.0 range. The 
trend indicates that major mergers are more likely to get merger 
votes than minor mergers. Minor mergers only outnumber major 
mergers in the 0.4 < f m < 0.5 bin. The effect is similar for both 
volume- and non-volume limited samples. 



The Merger Fraction (Strong Statement) 

~ 1 — 3 X C% of all galaxies in a volume-limited sample (M r < 
—20.55) in the local universe (0.005 < z < 0.1) are observed in a 
major merger [3 

The large error obtained here arises due to the simplic- 
ity of the Galaxy Zoo interface. The recently released Galaxy 
Zoo Two project, which focuses on more specific questions 
(e.g. is the system 'merging' or 'interacting') and removes 
the dichotomy of describing either the morphology or the 
merger-likeness of the system, should lead to a much more 
exact figure for the merger fraction in the near future. 



3.4 Close-Pairs Comparison 

Comparing the percentage in this strong statement with 
close-pairs studies is difficult because few of them present 
their results in terms of merger fractions. They also use 
different criteria for their volume- limiting bounds and for 
accounting for errors which are important in such studies 
(such as fiber collisions, fals e-pairs and n o n-gra vitationally 
bound pairs) . For example, iPatton et al.l (|2000l ) concluded 
that ~ 1.1% of nearby galaxies with —21 < Mb < —18 are 
undergoing a merger (but not necessarily a major merger by 
our definition). Our percentage is similar but the definitions 
are so diverse as to render co mparison obs o lete. 

A more recent study by IPatton et ail (|200St ) on SDSS 
close-pairs measures the number of 'close-companions' per 
spectral object which satisfy three constraints: physical 
separation (r p ; 5 < r p < 20h~ 1 ), rest-frame veloc- 
ity (Av < 500kms _1 ) and absolute magnitude difference 
(|AM r ^ 0.753|). Again though, there are significant differ- 
ences between their samples and ours. Their magnitude lim- 
its are —22 < M r < — 18 and they define a major merger 
by luminosity (at 1:2), not by mass as we do (at 1:3). More 



20 Where we use the same corrective factor C ~ 1.5 (Appendix 
A). 
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Weighted-Merger-Vote Fraction 

Figure 11. The distribution of volume-limited close-pairs objects 
in the MGS comparing f m and their projected separations. The 
vertical broken line marks the boundary for our merger catalogue 
at f m = 0.4. The solid diagonal line is a best-fit to the distribution 
of points in this / m -separation space. The shading indicates the 
density of points in various regions of this space. GZ users were 
more likely to call a close-pairs object a merger as the projected 
core-separation decreased. 



significantly, only ~ half of the pairs satisfying their cri- 
teria are "known to exhibit morphological signs of interac- 
tions" whereas our sample is selected on the basis of visually- 
established interactions. So although the number of close- 
companions they calculate (N c ~ 0.02) is similar to our 
major-merger fraction (~ 1— 3xC%) the differences between 
the two techniques make claims of corroboration difficult to 
substantiate. 

We performed our own comparison of GZ with a close- 
pairs catalogue of all SDSS spectral objects within a pro- 
jected separation of 30kpc of each other and a line-of-sight 
velocity difference of < 500kms -1 . For a consistent compar- 
ison with our catalogued mergers, we examine only those 
objects which are within the volume-limited boundaries 
(0.005 < z < 0.1; M r < -20.55) and the MGS. This gives 
2308 individual close-pair objects. Many of these will be part 
of a false-pair arising from the automated deblending of a 
single galaxy into two or more spectral targets (see ijl.l.ip . 
Some pairs will also appear well separated (relative to their 
size) and show no signs of interaction. We therefore visu- 
ally examined all 2308 objects in order to determine which 
ones are in a 'strongly-perturbed' state brought about by 
the galaxy represented by the other spectral object in the 
close pair. This led to the removal of 654 (~ 28%) of the 
close-pairs objects in our volume-limited MGS sample. 

The 1654 remaining spectral objects are all GZ objects 
and so we can examine how users voted for them. We found 
that a significant portion (~ 64%) of these close-pair spec- 
tral objects which are in 'strongly-perturbed' systems (by 
our reckoning) had f m < 0.4 and were therefore excluded 
from our catalogue. However, we also found that there is 
a strong correlation between f m for close-pairs objects and 
their projected separation such that the further apart close- 
pair objects are, the less likely users were to label it a merger 
(see Figure [TT}. This means that the GZ technique selects 
systems which are generally more advanced in the merger 
process and therefore undergoing more extensive interac- 
tions than a typical system in close-pairs studies. Only ~ 600 



of these objects are in our catalogue of ~ 3000 pairs which 
fits with the claim that only ~ 20% of (relatively advanced) 
merging systems are spectral pairs in SDSS, the rest be- 
ing systems with only a single spectral object within our 
magnitude-limited volume. 

We conclude that, like the CAS and GM20 techniques, 
the GZ method for detecting mergers is sensitive to different 
stages of a merge r compared to the close-pairs technique 
|Lotz et alj|2008bft . 

3.5 Estimating the Spiral to Elliptical Ratio in 
Merging Systems 

3.5.1 Uncertainties and User Bias 

Having established an estimate of what fraction of galax- 
ies in the local universe are merging, we now turn to the 
more difficult task of estimating the spiral-to-elliptical ra- 
tio of galaxies that are merging In addition to 
the uncertainties associated with identifying actual mergers, 
there are now the uncertainties associated with varying im- 
age resolu tion leading to ' sure' a nd 'unsure' morphologies 
(see H2.4J1 . IBamford et al.l (|2008aT ) have studied extensively 
the dependence of GZ morphological classifications on red- 
shift and found that the proportion of elliptical classifica- 
tions increases with 2. The same problem arose for our clas- 
sifications: over the range 0.005 < z < 0.03 the fraction of 
'unsure' morphologies (EU, SU) to 'sure' morphologies (E, 
S) is ~ 10% but this fraction rises to ~ 25% as we vary over 
0.03 — > 0.1. In short, the more distant a galaxy is the more 
'featureless' it appears bringing about a visual misclassifi- 
cation that inflates the recorded number of ellipticals. For 
details of the quantific ation and correction to this effect, see 
IBamford etail (|2008al ). 

A further unknown peculiar to GZ is mass-user psy- 
chology. A natural concern of ours was that users would be 
more likely to call a system a 'merger' if it involves two spi- 
rals (which generally look more dramatic) than if it involves 
two ellipticals. We therefore examine how the ratio of spi- 
rals to ellipticals in our merging-pairs catalogue varies with 
fm as shown in Figure 1121 Whether we use just the 'sure' 
morphologies (E+S sets) or whether we include the 'unsure' 
morphologies (E+S+EU+SU sets), the N 3 /N e ratios appear 
to follow curves that start high for f m ~ TO and decay to- 
wards a constant as f m — > 0.4. This confirmed the suspicion 
that the systems compelling users to click the merger but- 
ton the most tend to involve spirals. However, the majority 
of mergers are located in the bins where the ratios level off 
at roughly a constant (~ 0.4 < f m < 0.6). Inclusion of the 
EU and SU categories slightly decreases N 3 /N e as expected 
(some 'unsures' are really spirals but get mistaken for ellip- 
ticals) . 

The horizontal lines of Figure [TJ] indicate the mean 
N s /Ne ratios for the E+S and E+S+EU+SU samples over 
the whole range 0.4 < f m < TO. These N s /N e means are 
~ 5 and ~ 3 respectively. The true mean over this range 
must surely be N 3 /N e > 3 therefore since the 'unsure' mor- 
phological categories deflate N 3 /N e through false inclusion 
of ellipticals as discussed earlier. 

21 Where N 3 and N e are the number of galaxies in a given sample 
of mergers that are spiral and elliptical respectively. 



Galaxy Zoo: the properties of merging galaxies (I) 13 



30 



.9 20 



jlt-fVbl. Limited) E+SiGalaxies 
o(Vbl. Limited) E+S+EU+SU 



Galaxies 



0.4 



0,5 0,6 0,7 0-8 0,9 

Weighted-Merger-Vote Fraction (f m ) 



1.0 



Figure 12. An examination of the relation between morphology 
and fm- We magnitude limit our sample (M r < —20.55) and 
measure the ratios of spiral to elliptical morphologies (N s /N e ) of 
mergers for bins shown along the f m axis. As / m — > 1.0 a strong 
bias is seen for users to flag mergers involving spirals. N s /N e 
appears to converge to roughly a constant as / m — » 0.4. The 
broken horizontal lines are the mean N s /N e values for the two 
samples over the whole range 0.4 < f m $J 1.0. 



3.5.2 Towards an N„/N e Estimate for Merging Galaxies 

We proceed now by extrapolating this value N 3 /N e > 3 
(found for the range f m > 0.4) to the entire range of f m , 
that is, we assume N s /N e > 3 for all galaxies in mergers for 
our volume-limited ranges. We compare this ratio with the 
global spiral-to-elliptical ratio for all galaxies in our redshift 
range determined using the corrections of iBamford et al.l 
|2008al ) to debias the high occurrence of ellipticals with in- 
creasing redshift. This debiasing leads to the estimate that 
there are ~ 3 : 2 spirals to ellipticals for all galaxies with 
M r < —20.55 in our redshift range. Our extrapolated esti- 
mate of N a /Ne > 3 for f m > 0.4 is significantly higher in 
comparison, i.e. our extrapolation would suggest that spirals 
feature in mergers roughly twice as often as they should if 
selected randomly from the global population. 

To test the accuracy of this extrapolation we examined 
the morphologies of the same 59 systems from the 100 im- 
ages of 0.2 < f m ^ 0.4 (used in H3.ip that were deemed 
to be 'strongly-perturbed.' We found that 42/59 of these 
'strongly-perturbed' galaxies were either S or SU and the 
remaining E or EU. Taking N 3 /N e ~ 1.5 for the global es- 
timate to give null binomial probabilities p(spiral) = 0.6 
and p(elliptical) = 0.4, the expected outcome of 59 galaxy 
morphologies would be 35.4 ± a spirals with standard de- 
viation a ~ 3.8. The observed number, 42, is just within 
two standard deviations of the expected value. This obser- 
vation therefore supports the claim that, even in the range 
0.2 < f m < 0.4, N s /N e in mergers is higher than the global 
mean. This also does not take into account the fact that our 
observations are still biased by inclusion of 'unsure' mor- 
phologies (which inflates the number of ellipticals) whereas 
the estimate N s /N e ~ 1.5 for the global population has been 
debiased. 

To summarise, we find that N 3 /N e > 3 over the range 
of our merging-pairs catalogue (/ m > 0.4). This is at least 
twice the global ratio (N s /N e ~ 1.5). There is no evidence 



to suggest that mergers in the range f m < 0.4 will compen- 
sate this effect with an excess of ellipticals realtive to their 
global population. The high N s /N e ratio is especially likely 
to stand up to scrutiny for major-mergers since they have 
been shown to favour higher values of f m (see Figure 1 10p 
where the spiral excess has been robustly confirmed. 



3.5.3 Implications of a High N a /N e Ratio in Mergers 

We now discuss possible reasons for a high N a /N e ratio in 
mergers. It is well established that spirals tend to occupy 
les s-dense envir onments than ellipticals (verifie d as early 



__ Oemler|[l974l and as recently as, for example, iBall et al.l 
120081 ). The discrepancy might therefore be the result of a 
preference for mergers to occur in field environments where 
spirals are more populous relative to the global population. 
Alternatively though, the disproportionate number of spirals 
in our 'snapshot' of the local universe could indicate that the 
time-scales over which spiral galaxies remain detectable in 
mergers exceeds that of ellipticals. 

Studies have been carried out to estimate the time- 
scales of mergers using dynamical-friction arguments (e.g . 



Conselice l2006al) and simulations (e.g. iBell et alj 120061 ; 
Conselice|[2006al ; lLotz et alj|2008bl ). lLotz et al.l (|2008bl ). in 



particular, focus on this question and find that the gas- 
fraction of galaxies in mergers is one of several factors 
determining the time-scale of detectability. One such sim- 
ulation comparing two equal-mass mergers with different 
gas-fractions showed that the system with the higher gas- 
fraction (fgas ~ 50%) remained 'morphologically disturbed' 
(i.e. flagged as a merger by the standard criteria of G, M20, 
C, A combinations) for 2 — 4 times longer than the system 
with a lower gas- fraction (f ga s ~ 20%). 

In practise, any variation in N s /N e for merging galax- 
ies will depend on a combination of these two explanations 
since, a priori, it seems certain that the merger frequency 
will have some dependence on environment and that the 
merger-time-scale of detectability will have some depen- 
dence on the internal characteristics that distinguish spirals 
from ellipticals (such as gas-content and overall mass). We 
examine these two effects in more detail in the companion 
paper D09b and conclude that, in fact, the latter explanation 
seems to be the most likely, that is, that mergers involving 
spirals remain detectable for longer times than mergers with 
involving ellipticals. 



4 SUMMARY AND DISCUSSION 

Galaxy Zoo is a new and powerful strategy for locating 
mergers. The technique is similar in its effect to the CAS 
and GM20 methods in that it converts images to numbers 
that provide a measure of how 'merger-like' a galaxy is (in 
this case the weighted-merger- vote fraction f m G [0, 1]). The 
method is highly apt at locating mergers because / m is the 
averaged product of human minds (which are highly adept at 
pattern recognition) and is therefore extremely sensitive to 
details while doing away with the major programming chal- 
lenges associated with automated methods. It is also easier 
to partition merger-spaces in this method since / m is a sin- 
gle parameter unlike CAS and GM20 which demand more 
fine-tuning. The technique is also not limited to objects with 
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spectra as the close-pairs method is (by definition) and we 
find in fact that only ~ 20% of our merging systems have 
spectra on both galaxies. This is mostly due to the large 
fraction of the SPSS sky th a t suffers from fiber- collisions 
(~ 70%; IStrauss et all |2002| . iBlanton et ail 120031 ). It also 
means that the method is effective at the broader task of 
finding 'strongly-perturbed' systems including minor merg- 
ers. The drawbacks to GZ are time and repeatability. The 
results presented in this paper are derived from about six 
months of web activity. 

It is worth emphasising that the results of this paper are 
entirely derived from the pressing of a single button labeled 
'merger.' The simplicity of this interface lead to an unfortu- 
nate dichotomy whereby users were often unsure whether to 
emphasise the merger aspect or the morphological aspect of 
a given system. Now that the competence and eagerness of 
the public to assist in extragalactic astronomy is known and 
tested, we expect greatly improved results with the Galaxy 
Zoo Two project - a new development that will enable finer 
classification of SDSS objects as well as higher redshift sur- 
veys. 

Galaxy Zoo has already yielded rich results with the 
initial merger catalogue presented here containing 3003 
merging-pairs in mergers in the range 0.005 < z < 0.1 cre- 
ated from GZ objects with f m > 0.4. Each has been been 
visually-inspected by fifty or so Galaxy-Zoo users and by one 
of the authors (DWD). We believe that it is the largest of its 
kind. Completeness, however, remains an issue as users and 
experts are only as accurate as the image quality allows. As 
redshift increases, the reduced image quality makes it diffi- 
cult to identify galactic projections. Also, there will always 
be the problem for any merger-detection method in deciding 
whether or not a galaxy perturbation (such as an extended 
tidal tail) is great enough to warrant the term 'merger.' For 
these two reasons, there is a large number of 'border-line' 
cases in the range < f m < 0.4 which are responsible for 
the error in our estimate of the merger fraction in the local 
universe. 

To obtain this merger fraction, we first estimated the 
number of 'strongly-perturbed' galaxies with spectra in bins 
over the range < f m < 0.4 plus those in our catalogue. 
Dividing this number by the total number of volume-limited 
spectral objects lead us to estimate that f mg s ~ 4 — 6%. 
With the correction factor C ~ 1.5 converting fmgs to f re ai 
estimated in Appendix A, one can say that ~ 4—6 x C% of all 
volume-limited galaxies (0.005 < z < 0.1 and M r < -20.55) 
are 'strongly-perturbed.' We expanded upon this statement 
by estimating what subset of 'strongly-perturbed' volume- 
limited spectral galaxy objects are major mergers in the 
local universe. This led to the stronger statement that: 

~ 1 — 3 X C% of volume-limited galaxies in the local universe 
(M r < —20.55, 0.005 < f m < 0.1) are observed in major mergers. 

We also estimated the ratio of spirals to ellipticals for 
merging galaxies with M r < —20.55. The N„/N e ratio for 
our catalogue was highly in favour of spirals by ~ 3 : 1. 
We examined 100 systems in the range 0.2 < f m < 0.4 and 
found a similar result. This is large compared to the global 
N 3 /Ne ratio (~ 3 : 2) for this magnitude- limited volume. 

We therefore concluded that more spirals are seen to be 
merging than ellipticals, perhaps by as much as factor ~ 2. 
We then discussed possible reasons for this observed spiral 



excess suggesting that either (1) mergers tend to occupy 
environments that favour spirals, or (2) the time-scale for a 
merger to reach a relaxed state (i.e. the time over which it 
is detectable) is longer for spirals than for ellipticals I 22 ! 

We disentangle these effects in the companion paper 
D09b by exploring the role of the environment and the in- 
ternal properties of this merger sample. We find in fact that 
mergers occupy the same (if not slightly denser) environ- 
ments as a randomly selected control sample of galaxies. We 
conclude there that the best explanation of an apparently 
high spiral-to-elliptical ratio in mergers, as we find here, 
must be due to varying time scales of detectability. The 
properties of spirals and ellipticals that affect merger de- 
tactability are therefore an important issue that the Galaxy 
Zoo catalogue can help us understand. 



5 ACKNOWLEDGMENTS 

D. W. Darg acknowledges funding from the John Templeton 
Foundation. S. Kaviraj acknowledges a Research Fellowship 
from the Royal Commission for the Exhibition of 1851 (from 
Oct 2008), a Leverhulme Early-Career Fellowship (till Oct 
2008), a Senior Research Fellowship from Worcester College, 
Oxford and support from the BIPAC institute, Oxford. C. 
J. Lintott acknowledges funding from The Levehulme Trust 
and the STFC Science in Society Program. K. Schawinski 
was supported by the Henry Skynner Junior Research Fel- 
lowship at Balliol College, Oxford. 

Funding for the SDSS and SDSS-II has been provided 
by the Alfred P. Sloan Foundation, the Participating In- 
stitutions, the National Science Foundation, the U.S. De- 
partment of Energy, the National Aeronautics and Space 
Administration, the Japanese Monbukagakusho, the Max 
Planck Society, and the Higher Education Funding Council 
for England. The SDSS Web Site is |http://www.sdss.org/| 
The SDSS is managed by the Astrophysical Research Con- 
sortium for the Participating Institutions. The Participat- 
ing Institutions are the American Museum of Natural His- 
tory, Astrophysical Institute Potsdam, University of Basel, 
University of Cambridge, Case Western Reserve Univer- 
sity, University of Chicago, Drexel University, Fermilab, 
the Institute for Advanced Study, the Japan Participation 
Group, Johns Hopkins University, the Joint Institute for 
Nuclear Astrophysics, the Kavli Institute for Particle As- 
trophysics and Cosmology, the Korean Scientist Group, the 
Chinese Academy of Sciences (LAMOST), Los Alamos Na- 
tional Laboratory, the Max-Planck-Institute for Astronomy 
(MPIA), the Max-Planck-Institute for Astrophysics (MPA), 
New Mexico State University, Ohio State University, Uni- 
versity of Pittsburgh, University of Portsmouth, Princeton 

22 The probability, p , of observing a galaxy merging at any gen- 
eral time is proportional to the probability of its merging at that 
time, p m , multiplied by the time-scale, r, over which it is de- 
tectable, p oc p m T. The probability of merging should only de- 
pend on the system's mass and environment (p m = p m (M,p)) 
whereas the time-scale of detectability, r, must be assumed a 
priori to depend on the internal properties of the merging galaxy 
such as gas content and mass distribution. Our high N B /N e ratio 
therefore suggests that either p m (M,p) or r is large for spirals 
compared to ellipticals. 



Galaxy Zoo: the properties of merging galaxies (I) 15 



University, the United States Naval Observatory, and the 
University of Washington. 



REFERENCES 

Abraham R. G., van den Bergh S., Nair P., et al., 2003, 

ApJ, 588, 218A 
Ball N. M., Loveday J., Fukugita N. M., Nakamura O., 

Okamura S., Brinkmann J., Brunner R. J., 2008, MNRAS, 

348, 1038 

Ball N. M., Loveday J., Brunner R. J., 2008, MNRAS, 383, 
907B 

Bamford S. R et al, 2008, MNRAS, 393, 1324B 

Barton E. J., Arnold J. A., Zentner A. R., Bullock J. S., 

Wechsler R. H., 2007, AJ, 671,1538 
Bell E. P., Phleps S., Somerville R. S., Wolf C, Borch A., 

Meisenheimer K., 2006, ApJ, 652, 270B 
Bernardi M., et al., 2003, AJ, 125, 1817B 
Blanton M. R., Lin H., Lupton R. H., Maley F. M., Young 

N., Zehavi I., Loveday J., 2003, AJ, 125, 2276 
Bond J. R., Cole S., Efstathiou C, Kaiser N., 1991, ApJ, 

379, 440 

Bundy K., Ellis R. S., Conselice C. J., 2005, AJ, 625, 621 
Calzetti D., Armus L., Bohlin R. C, Kinney A. L., Koorn- 

neef J., Storchi-Bergmann T., ApJ, 533, 682 
Capak P., Abraham R. C, Ellis R. S., Mobasher B., Scov- 

ille N., Sheth K., Koekemoer A., 2007, ApJS, 172,284 
Cardamone C. N. et al., 2009. larXiv:0907.4155C 
Carlberg R. G. et al., 2000, ApJ, 532, 1 
Cole S., Lacey C. G., Baugh C. M., Frenk C. S., 2000, 

MNRAS, 319, 168 
Conselice C. J., 2003a, ApJS, 147, 1 

Conselice C. J., Bershady M. A., Dickinson M., Papovich 

C, 2003b, ApJS, 126, 1183 
Conselice C. J., 2006, arXiv:0610662v2 
Conselice C. J., 2006, AJ, 638, 686 

Conselice C. J., Rajgor S., Myers R., 2008, MNRAS, 386, 
909 

Cox T. J., Jonsson P., Somerville R. S., Primack J. R., 
Dekel A., 2008, MNRAS, 384, 386 

Darg D. W. et al., 2009, MNRAS, submitted (D09b in pa- 
per) 

De Lucia G., Springel V., White S. D. M., Croton D., Kauff- 

mann G., 2006, MNRAS, 366, 499 
Di Matteo T., Springel V., Hernquist L., 2005, Nature, 433, 

604 

Dressier A., 1980, AJ, 236, 351 

Ellison S. L., Patton D. R., Simard L., McConnachie A. 

W., 2008, AJ, 135, 1877 
Fakhouri O., Ma C.-P., 2008, MNRAS, 386, 577 
Fukugita M. et al., 2007, A J, 134, 579 
Hopkins A. M. et al., 2003, ApJ, 599, 971 
Hopkins P. F., Cox T. J., Younger J. D., Hernquist L., 

2009, ApJ, 691, 1168 
B. C. Hsieh, H. K. C. Yee, H. Lin, M. D. Gladders, D. G. 

Gilbank, 2008, ApJ, 683., 33H 
Jogee S. et al., 2008. larXiv:0802.3901l 
Jogee S., 2008, arXiv:0408383v2 
Kaviraj S., 2007, larXiv:0801.0127t rl 
Kaviraj S., 2007. larXiv:0710.1311i d 



Keel W. C, Kennicutt R. C. Jr., Hummel E., van der Hulst 

J. M., 1985, AJ, 90, 708 
Kennicutt R. C. Jr., Roettiger K. A., Keel W. C, van der 

Hulst J. M., Hummel E., 1987, AJ, 93, 1011 
Khochfar S. & Burkert A., 2003, AJ, 597, 117 
Lacey C. & Cole S., 1993, MNRAS, 262, 627 
Lahav O., Nairn A., Sodre L. Jr., Storrie-Lombardi M. C, 

1996, MNRAS, 283, 207 
Land K. et al., 2008, MNRAS, 388, 1686L 
Le Fevre O. et al., 2000, MNRAS, 311, 565 
Li C, Kauffmann G., Heckman T. M., White S. D. M., 

Jing Y. P., 2008, MNRAS, 385, 1915 
Lin L. et al., 2008, ApJ, 681, 232L 
Lintott C. J. et al., 2008, MNRAS, 389, 1179L. 
Lintott C. J. et al., 2009, arXiv0906.5304L 
Lisker T., 2008, ApJS, 179, 319L 

Lotz J. M., Primack J. & Madau P., 2004, A AS, 36, 1615 
Lotz J. M. et al., 2008, ApJ, 672, 177 

Lotz J. M., Jonsson P., Cox T. J., Primack J. R., 2008, 

MNRAS, 391, 1137L 
Nakamura O., Fukugita M., Yasuda N., Loveday J., 

Brinkmann J., Schneider D. P., Shimasaku K., SubbaRao 

M., 2003, AJ, 125, 1682 
Oemler A., 1974, ApJ, 194, 1 

Park C, Choi Y.-Y., Vogeley M. S., Gott J. R. Ill, Blanton 

M. R., 2007, ApJ, 658, 898 
Patton D. R., Carlberg R. G., Marzke R. O., Pritchet C. 

J., da Costa L. N., Pellegrini P. S., 2000, ApJ, 536, 153 
Patton, D. R. et al., 2002, ApJ, 565, 208P 
Patton, D. R. & Atfield J. E., 2008, ApJ, 685, 235P 
Salpeter E., 1955, ApJ, 121, 161 
Scarlata C. et al., 2007, ApJS, 172, 406 
Schawinski K., Thomas D., Sarzi M., Maraston C, Kaviraj 

S., Joo S.-J., Yi S. K., Silk J., 2007a, MNRAS, 382, 1415 
Schawinski K. et al., 2009, (in preparation) 
Schweizer F., 2005, ASSL, 329, 143S 

Skibba R. A., Bamford S. P., Nichol R. C, Lintott C. J., 

et al., 2008, MNRAS, submitted 
Slosar A. et al., 2009, MNRAS, 392, 1225S 
Stoughton C. et al., 2002, AJ, 123, 485 
Strauss M. A. et al, 2002, AJ, 124, 1810 
Toomre A. & Toomre J., 1972, ApJ, 178, 623 
Van der Wei, A., 2008, ApJ, 13, 675 
White S. D. M. & Rees M. J., 1978, MNRAS, 183, 341 
Woods D. F. & Geller M. J., 2007, AJ, 134, 527 
York D. G. et al., 2000, AJ, 120, 1579 



APPENDIX A: ESTIMATING THE MERGER 
FRACTION USING SPECTRAL OBJECTS 

Here we clarify some of the subtle issues that arise when 
attempting to estimate the merger fraction of the universe. 
It was claimed in i)3.1l that one could use spectral objects 
alone to obtain an accurate estimate of the merger fraction. 
This is true even though a sizable number of galaxies in the 
universe are strongly-perturbed but do not have spectra due 
to fiber collisions (two spectral targets within 55" cannot 
both obtain spectra in regions with only a single tiling). 
In overlap regio ns, there is near spectral completeness for 
spectral targets (jStrauss et al.|[2002l : iBlanton et al 1 l2003l : see 
Figure \M}. 
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Figure Al. An illustration of the SDSS spectroscopic tiling 
scheme. The overlap region shaded grey covers a fraction k sky 
of the area. Strongly-perturbed systems are coloured with blue 
outlines and single galaxies black. Those targets which receive 
spectra arc marked with a red cross. Here we only consider sys- 
tems where some strongly-perturbed spectral objects are merging 
with spectral targets. 
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Now we let the ratio of binary-merger pairs N m to single 
laxies in the local universe equal p. Taking the ratio of 
12)) and (|A3[) and substituting N m = pN s will give the 
corrective factor, C, relating our fraction to the real fraction: 



C(k sky ,p) 



freal _ 2(1 + pk sky + p) 
fmgs (2p + 1)(1 + k sky ) ' 



(A4) 



When k s ky = 1 (i.e. when the whole sky is an overlap 



region) we get f Tl 



fr, 



For realistic values like k 



sky 



30%, p ~ 5% we get a ratio C(k sky = 0.3, p = 0.05) ~ 1.5 
and we find that (|A4[) is not very sensitive to changes in p. 
Multi-mergers have the affect of increasing C but are rare 
enough to be ignored (only comprising ~ 1% of the merging- 
pairs catalogue) and so we can take our corrective factor to 
be C = 1.5. 



In calculating the merger fraction, we only use spectral 
objects belonging to the Main-Galaxy-Spectroscopic sample 
(MGS) in SDSS. This is the collection of targets identified 
photometric ally as galaxies (but not quasars or lum inous 
red-galaxies: IStoughton et alfeoolllstrauss et al.ll2002l ) with 
r < 17.77 and is intended fo r statistical samplin g of galaxies 
from a uniform population (|Strauss et al.|[2002h . 

Figure IA1I depicts a simple tiling scheme analogous to 
that employed by SDSS (jBlanton et al.ll200St ). The shaded 
area represents the overlap which we say covers a fraction 
k sk y °f the tiled area (for SDSS k sky ~ 30%). Two ob- 
jects within 55" in non-overlap regions suffer from fiber- 
collision so that only one can get measured. Within overlap 
regions virtually all targets get measured due to multiple 
tiling opportunities. Altogether about 6% of all MGS tar- 
gets d o not obtain spectr al measurements due to fiber colli- 
sions (|Strauss et alj|2002h . These are, of course, the objects 
most likely to be of interest in a mergers study though many 
might be projections, so one cannot assume they are part of 
a perturbed system. 

We argue now that our estimate for the fraction of 
'strongly-perturbed' volume-limited MGS objects (fmgs) is, 
upto a correction factor close to unity, a good approximation 
to the 'real' fraction of volume-limited galaxies (freal)- For 
simplicity, let us begin with a sky as depicted in Figure lATl 
involving only galaxies which are bright enough to be spec- 
tral targets and where some of these might be in the process 
of a binary merger such that they are within 55" of another 
spectral target. We assume that mergers are randomly dis- 
tributed with respect to tile-overlap regions. Let the number 
of single (i.e. non-perturbed) galaxies be N s and the number 
of binary-merger pairs be N m ■ The merger fraction that we 
calculated, fmgs, would therefore be 



2k sk yN m + (1 - k skv )Nm 



J S 2kskyN m + (1 - k 8ky )N m + Ns 

Nm(l + k 3k y) , 

N m (l + ksky) + N 3 ' 1 ' 

The real merger fraction, (f rea i) should take account of 
everything regardless of overlap regions so that the fraction 
of strongly perturbed spectral targets is 



285,869 GZ objects 
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